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CHAPTER I 



THE PROBLEM AND DEFINITION OF TEEMS 
Statement of the Problem 

The purpose of this study was to ascertain the effectiveness 
of a non-programmed, problem-solving method of teaching seventh-grade 
general science and that of a programmed, problem-solving approach 
through selected, open-ended, laboratory experiences. 

An environment for personal inquiry is being seen as a funda- 
mental characteristic of the science program at all levels. Inquiry 
is broadly defined as a search for truth, information, or knowledge. 
Teaching science as inquiry should show students how knowledge arises 
from the interpretation of data, that the interpretation of data 
proceeds on the basis of concepts and assumptions that change as our 
knowledge grows. Because these principles and concepts change, 
knowledge changes also inasmuch as we know more, and better, than we 
did before. The student should build up his store of knowledge in 
connection with the laboratory experience, rather than go through 
the laboratory experience merely to illustrate and clarify the 
knowledge which he has acquired elsewhere. 

Objectives 

The specific objectives are listed below: 

1. To compare the data for any significant gains in factual, 
subject-matter competency as measured by the factual- 
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oriented posttest. 

2. To compare the da 'a for any significant gain in the 
comprehension of scientific concepts, principles, and 
generalizations as measured by the conceptunl-oriented 
posttest . 

i 3. To determine the difference between the mean achievement 

of students using programmed materials and the mean 
achievement of students using the non-programmed approach 
as measured by the total posttest. 

4. To ascertain if there are differences between the effects 
of the two approaches concerning development of interest 
in science as measured by the Kuder General Interest 

* Survey (Form E). 

Hypotheses 

This experiment was based upon the substantive hypothesis 
that desirable change is brought about by the experimental treatment. 
Therefore, the substantive or research hypothesis that the experimental 
treatment will yield significant gain differences for the experimental 
groups, as measured by the posttest, is stated. In order to test a 
substantive hypothesis, however, it is necessaiy to present an opera- 
tional hypothesis or hypotheses in quantitative and statistical terms. 
The following operational hypotheses were tested in the null form: 

1* There is no significant difference in factual, science 
subject-matter competency between the control groups and 

‘ er|c 
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experimental groups as measured by the f actual-oriented 
posttest . 

2. There is no significant difference in the comprehension 
of scientific concepts, principles, and generalizations 
for students taught by the programmed., laboratory approach 
as compared with the gain for students taught by the 
non-programmed technique as measured by the conceptual- 
oriented posttest . 

3. There is no significant difference between the mean achieve- 
ment of students using programmed general science materials, 
and the mean achievement of students using the non- 
programmed approach as measured by the total unit posttest. 

4. There is no significant difference concerning development 
of interest in science for the students in the experimental 
group as compared with students taught by the non-programmed 
method as measured by the Kuder General Interest Survey 
(Form E) . 

Definition of Terms 

For the purpose of this study, the terms listed below have 
the following meanings; 

Concept — an idea including many separate percepts; a mental 
image of an event, a quality, or a quantity. 

Generalization — the application of a general concept to a 

relatively new object or situation; process whereby 
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one reaches a judgment applicable to a whole class 
often on the basis of experience with a limited 
number of the class. 

Interest — an individuals preference for an activity in 
reference to his degree of liking for alternate 
activities as measured by a standardized inventory 

i 

such as the Kuder General Interest Survey (Form E) . 

Scientific Interest— when capitalized, refers to the Scientific 
scale of the Kuder General Interest Survey (Form E) 
or the student 1 s score on this scale. 

Science achievement — the number of correct responses on a test 
designed to assess the subjects understanding of 
the science subject matter, and the concepts, 
principles, and generalizations relevant to the 
respective curriculum pertaining to this study. 

• • Intelligence Quotient— the deviation I.Q. taken from published 
norms of the California Short-Form Test of Mental 
Maturity . Junior High Level . 

Reading level — raw score values for both reading vocabulary 
and reading comprehension as assessed by the 
California Achievement Tests , Junior High Level . 

Science average— a value derived by assigning number values 
to the five-letter grading scale as follows: A = 4; 

B = 3; C = 2j D = 1; F = 0. For this study the science 
average was obtained from the two quarters of the 
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seventh grade of the current academic year, 1965-1966. 
ah? lity level — the I.Q. range from 75-89 of the population 
under study as measured by the California Short - Form 
Test of Mental Maturity , Junior High Level . 

Low average ability level— the I.Q. range from 90-104 of the 
population under study as measured by the California 
Short-Form Test of Mental Maturity , Junior High Level . 

High average ability level — the I.Q. range from 105-119 of 
the population under study as measured by the 
California Short-Form Test of Mental Maturity , Junior 
High Level . 

High ability level — the I.Q. range from 120 up of the population 
under study as measured by the California Short- Form 
Test of Mental Maturity , Junior High Level . 

Stimulus— statement , rule, principle, or example presented in 
a manner designed to evoke a specific response from 
a subject. 

Response — reply or reaction by a subject to a stimulus. 

Frame— a unit in a programmed sequence arranged in a particular 
pattern of stimulus, response, * and reinforcement. 

Programmed instruction — learning content arranged in frames 
and presented in a sequence designed to elicit 
specific responses, followed by immediate reinforcement. 

Linear program — program in which learning content is divided 
into small increments so arranged that there is 
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minimal' possibility of subject errors, 
problem-solving approach — an orderly method of applying 
inductive and deductive methods of reasoning to 
perplexing, open-ended, laboratory experiences. 
Open-ended experiences— —the open— endedness and inquiry value 
of these experiences appear mainly in the form of 
the questions that are asked in the body of the 
directions. These questions lie at four levels; 

(a) those which can be answered from the experience 
at hand, (b) those which can be answered only after 
considerable investigation and thinking, (c) those ‘ 
which cannot be answered with certainty by the student 
or anyone else, (d) those which probably cannot be 
answered at all with our present state of knowledge. 
Also under further investigations suggested at the end 
of each of the programmed lessons, the experience is 
open-ended in that the student may apply, indefinitely, 
the principles learned to extensions of the same idea. 

Non-programmed method — a method whereby students are taught 

by a problem-solving, lecture-demonstration technique. 

Programmed method — a method whereby students are instructed by 
means of a programmed, problem-solving, open-ended, 
laboratory approach. 
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Treatment — administration of the programmed sequence or the 
non-programmed instructional procedures. 
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Factual-criented test— a test with items designed for recall 

of facts. 

0 

% 

Conceptual-oriented test — test items designed for measuring 
• • • 

a student 1 s knowledge of certain concepts, and their 
ability to apply concepts to a relatively new situation. 
Total unit test — the combined f actual-oriented and conceptual- 
oriented test items. 

Basic Assumptions 

This study is dependent upon the following basic assumptions; 

1. That the California Short-Form Test of Mental Maturity , 

Junior High Level provides reasonably valid measures of 
the student r s language ability and non-language ability. 

fu m . f , 

2. That the personal information provided by the student 1 s 
permanent record folder is correct. 

3. That student responses to an evaluative questionnaire 
are valid indications of their likes and dislikes 
concerning programmed materials. 

4. That the students* science teachers* assessment of their 
performance, in the former quarters of the seventh grade, 

is a valid indication of their performance level in science. 

Significance of the Study 



o 

ERIC 



In teaching science, a continuing area of concern to educators 
has been the problem of integrating laboratory experimentation with 
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scientific theory. The need to individualize course content for 
students is also recognized. Programmed instruction may offer one 
solution to both these difficulties. If the science curriculum can * 
be programmed so that individual rates of learning can be dealt with 
more effectively, and if laboratory materials can be developed which 
enable students to conduct experiments efficiently on an individual- 
ized basis, laboratory experimentation will be integrated more 
satisfactorily into the typical science course. TWhat is generally 
considered an advantage of programmed instruction is the systematic, 
controlled, and sequential fashion in which content is presented to 
the learner. In programmed instruction provision can be made for 
differentiating instruction to accommodate individual differences 
(Fry, 1959 ; Stolurow, 1961 ; Travers, 1945 ). It is the consensus of 
science educators that laboratory exercises must be incorporated into 
the programmed sequence in order to receive full benefit of those 
advantages in science courses. 

A survey of the subject areas in which programmed instruction 
has been used reveals that those areas which require laboratory-like 
activities are rare (Hanson, 1963). To the writers knowledge, this 
investigation should be unique in that no programmed, open-ended, 
laboratory experiments on the junior high level have been developed 
and tested previously. The conclusions from this study should aid in 
evaluating the future program of seventh-grade science instruction, 
and; concomitantly, should demonstrate what programmed, problem-solving, 




open-ended, laboratory experiences could contribute, if any contribution, 
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to the whole spectrum of science disciplines. 

A student 1 s laboratory exercises should constitute experiences 
which serve to help him discover and explore the principles being 
studied. This study seeks to obtain evidence relevant to this problem 
by investigating the results of using instructional materials which 
were designed to be an exploratory experience dealing with problems 
real to students. 

The results of studies of programmed materials in general 
are somewhat contradictory * The findings of this study will provide 
additional insight into the effectiveness of programmed materials 
as an instructional method. 

The results of this study should be of value to educators 
who are concerned with individualized instruction for upper elementary 
and/or junior high school students. 

Limitations of the Study 

Some limitations of this study are: 

1. The subjects included in the study were limited to those 
students drawn from the seventh-grade population of the 
Clarke County School System, Athens, Georgia, for the 
academic year, 1965-1966. 

2. The .study was limited to a total population of 221 students. 

The four sections of experimental groups numbered 106. 

% 

The four sections of control groups numbered 115. 




3. 



Due to the absences of the subjects on testing days, 
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incomplete data from cumulative records, or a student* s 

lo finish the experimental lessons, some students 

were of necessity dropped from the study. 

/ * 

4. For administrative reasons, intact classes of seventh 
graders had to be used for this study with the result 
that groups could not be matched or randomly assigned at 
the beginning of the study. However, the statistical 
model uc-ed compensated for original differences in the 
groups in achievement and intelligence. 

5. As has been cited by many investigators employing the 
experimental group-control, group design in research, the 
Hawthorne effect was potentially a contaminating factor. 
The achievement of students who were in an experimental 
group was affected by the mere fact that they were members 
of an experimental group, it was also likely that members 
of the control groups were aware of the experimental 
nature of the study. The exact effects of these realiza- 
tions can not be isolated, but the existence of these 
effects must be recognized. 

6. The only measure of student understanding of science in 
this study was the posttest instrument designed by the 
investigator. 

Organization of the Remainder 
of the Study 




This chapter has presented a statement of the problem, its 
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objectives, hypotheses, definitions of terms, basic assumptions, and 
limitations . 

The second chapter presents a review of related research, 
which includes a review of the development of programmed instruction, 
studies dealing with programmed material, reviews of programmed 
science materials, and research dealing with programmed laboratory 
experiences. The third chapter gives descriptions of the subjects, 
the experimental design, the program, description of the treatments, 
administration and duration of the experiment, sources and types of 
data, a description of the testing instruments, and the types of 
statistical analyses used. 

The fourth chapter presents results of the study; the fifth 
chapter includes a discussion of findings and conclusions. A. summary 
and implications for further research are contained in the sixth 
chapter. 
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CHAPTER II 

REVIEW OF RELATED LITERATURE 

Shortly after this century passed the half-way mark, followers 
of educational research and industrial training began to observe the 
emergence of a new method of learning. Actually, it was not new; it 
had its beginning even in the days of Plato and Socrates — and perhaps 
before. Attention Was being drawn by the novelty of this method 
inasmuch as it was diametrically opposed to the group lecture method 
monopolizing the classroom of the twentieth century. This method was 
programmed learning, regarded as a recent development because its 
practical application to education has become apparent only in the 
past few years. 

Because of its recency in entering the educational stage, the 
literature related to the development of programming is herein reviewed 
as a general background for this study, as well as its implementation, 
developments concerning instruction in the sciences, and research 
related to programmed laboratory experiences. The first section 
presents a summary of the development of programming. The second 
section preflenta a review of renoarch of programmed instruction as 
an educational medium. The third section consists of reviews of 
programmed science materials; the fourth encompasses the available 
literature related to programmed laboratory experiences. 

The Development of Programmed Instruction 




One of the earliest programmers was Socrates, who developed a 
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program for geometry, which was recorded by Plato in the dialogue, 
Meno. It was Socrates* method to guide his followers to knowledge 
by conducting them conversationally along a path from fact to fact 
and insight to insight. His method could merit being termed "oral 
programming 1 * which Banks (1963) has employed at the United States 
Air Force Academy in a course in aerodynamics using closed-circuit 
television. Each student has a switch which he uses to indicate his 
answer to key questions asked by the instructor throughout the lecture. 
The instructor has a device which records the number of correct 
answers and allows him to gauge the class* s understanding of each 
important point. A workbook of programmed exercises follows each 
television presentation to reinforce the lecture. 

Closer to our own times has come the tutorial method. This 
was perfected by the colleges of the great English universities and 
. taken up by many of this country* s colleges in one form or another. 

The continuous exchange of questions and answers between the tutor 
and his student, the unfolding of information and explanations, and 
the constant selection of new materials on the basis of t.he student* s 
mastery of what has gone before is indeed a forerunner of programmed 
instruction. 

The antecedents of current programming practices are many$ 
however, the method as it is now employed may be considered to derive 
from dynamic efforts begun in the 1920* s (lysaught and Williams, 1963). 

In 1926 Sidney L. Pressey, an Ohio State University psychol- 
ogist, made public his pioneer studies on the first recognized 



o 

ERIC 



14 



teaching machine (pressey, 1926). His model originally was conceived 
as a testing machine that presented a series of questions to a student 
and informed him immediately whether his reply was right or wrong.* 

The machine was essentially a box containing a revolving drum; the 
program was exposed through a slot in the box and the student indicated 
his response by pressing one of four buttons. The correct button 
released the drum and permitted the student to move on to the next 
question; an incorrect response did not release the mechanism. 

Pressey thought this principle to be useful; the student always found 
the correct answer, and as the correct answer was always the last 
answer given, it made the strongest impression on the student. By 
a simple adjustment, the machine could be used for testing; it merely 
progressed to the next question without regard to the students 
response (Pry, 1963). 

Even though Pressey 1 s devices initially emphasized the automa- 
tion of testing rather than of instruction, it was evident that they 
incorporated principles of learning enunciated by psychologists— the 
major emphases being on active participation, immediate confirmation, 
and individual progression adapted to the capabilities of individual 
learners (Hilgard, 1964). 

Despite the promise of Pressey 1 s experimental results and 
his interest, along with others, in encouraging further research, 
the movement in behalf of such devices lost its impetus. There are 
perhaps at least two explanations for this loss of impetus. First, no 
provision was made for systematic programming of materials to be used 
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in these machines, and second, the onset of the depression and its 
impact on social conditions and education apparently offered an 
unfavorable environment for such a revolution in the nations schools. 

* X 

By 1954, however, conditions in the world ‘had changed. Greater 
knowledge in classroom behavior, especially about the analysis of 
learning behavior, and a demand for education on an unprecedented 
scale, created a vastly different set of circumstances. In this 
fresh climate, two Harvard psychologists, B. F. Skinner and James G. 
Holland, devised auto-instructional methods which have served the 
current generation as the bases for its own excursions into pro- 
grammed learning. 

Skinner proposes that reinforcement is one of the essential 
processes in an instructional program. There is acknowledgment or 
reward of some kind to show that an organism has performed to 
satisfaction. Reinforcement theories have sprung from laboratory 
observation and analysis of the learning behaviors of lower organisms. 
However, a projection of their conclusions to human beings provides 
new insight into the educational problems of teachers and psycholo- 
gists alike. 

Considerations of Skinner supplied the ingredient missing 
from Pressey T s experiments. It provided the idea of programming 
instructional materials that Pressey 1 s machines needed. This theory 
has the further merit of complementing a body of educational beliefs 
long utilized in classroom teaching. The idea of reinforcement 
provides a rationale for believing that a complex body of learning 
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car, be separated into its smallest components. In any case, programmed 
learning does not break completely from previous methods of teaching. 

Historically, the term "program", as applied to a sequence of 
instruction presented by a teaching machine, derives from the 1954 
paper of Skinner, whose influence has guided the mainstream of 

developments in programmed instruction during the 1950’ s and 1960»s 
(Skinner, 1954). 

. Hilgard (1964) has defined a "program" as follows: An 
instructional program is a vehicle which generates an essentially 
reproducible sequence of instructional events and accepts responsibil- 
ity for efficiently accomplishing a specified change from a given 
range of initial competencies or behavioral tendencies. This defini- 
tion can encompass most of the forms of programs that have been 
proposed. It makes no particular theoretical presuppositions and 
does not require individual progress or overt response by the learner 
as part of the definition. The definition does imply a programmed 

sequence of learner behavior,' not merely a reproducible set of 
stimulus materials. 

The linear, or extrinsic, programs present a sequential 
development of the material through which each student, regardless 
of his response, proceeds in exactly the same order. The student 
responds to the first item, and then, to the second item. He goes 
on to the third and subsequent items in linear form irrespective 
of any errors he might make on intervening items. 

The constructed-response program is associated chiefly with 
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the work of psychologist B. F. Skinner and his associates, including 
James Holland. They believe that the absence of errors contributes 

to more efficient learning, as a correct answer is in effect a 

* * 

“reward" or “reinforcement" for the student; that is, knowledge 
that he has answered correctly is stimulating and rewarding to the 
student, and an incentive to greater effort or continued effort. 

The second basic type of program is the so-called branching 
or intrinsic program developed by Crowder ( 1959 ). In the branching 
program, the student is presented with a problem and with several 
alternative answers, one of which is correct. When the student 
chooses an answer he is instructed to move to a specified frame. 

This frame then tells him if his answer was incorrect and explains 
why it was incorrect. The frame then may return the student to the 
original item which he had answered incorrectly for another trial, 
or it direct him through a subprogram — further instructing him 
in the basic knowledge presumed to be necessary for the item he had 
answered incorrectly. In either case, the student is eventually 
returned to the missed item which he then, presumably, answers 
correctly. If he again chooses an incorrect alternative, a similar 
process is followed. Ultimately, he is returned to the missed item 
and answers it correctly. He is then directed to the next frame in 
the program where the same process may be repeated should he answer 
that item incorrectly. 

Branching programs are published in the form of scrambled 
textbooks when the publication is in the form of a book. The scrambled 
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textbook arranges the frames and alternative answers to frames in 
such a way that the student is directed to search through the book to 
proceed to the next step. It does not move sequentially, page by 
page, as does the linear program (Green, 1963). 

Research Related to programmed Materi^ O. 

Literature referred to in this section is representative of 
investigations related to programmed material. 

Coulson and Si3berman (I960) gave branching and linear programs 

i 

to junior college students and found the linear, constructed-response 
group had a higher mean score on the constructed-response section of 
the test, but there was no difference on the multiple-choice test. 

Fry (I960) used a linear program with either multiple-choice 
or constructed responses to teach Spanish words and phrases to ninth- 
grade pupils. The criterion test consisted of both types of items. 

The conclusion reached from the data was that the constructed- 
response method was superior for recall. 

Larkin (1964) experimented with sixty children aged fourteen 
years in using programs. No significant difference was found between 
linear and branching methods of an immediate posttest consisting of 
both composed and recognition items. Separate analyses of each section 
of the test found neither program superior for recall or for recog- 
nition. In addition there were no significant differences between 
ability levels on either part of the test even though average 
I.Q.s for the extreme groups differed by twenty points. Sixteen 
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weeks later the same test was administered again. Analysis of the 
full test scores showed no advantage to either method of programming; 
ability levels, were, however, significantly different on this 
occasion. 

In a second study, Larkin and Keith ( 1964 ), experimented to 
ascertain whether younger pupils learn better from a linear, 
constructed-response program or from a branching, multiple-choice 
program. It was predicted that less mature pupils would learn and 
retain more after using the constructed-response program. This 
hypothesis was confirmed. Analysis of variance indicated that the 
methods were significantly different. The significant ability-by- 
method interaction showed that the branching program is less 
effective with the lower ability groups. 

Analyses of the scores on the constructed-response and also 
on the multiple— choice items were carried out. Thus the conclusions 
of Fry, Coulson, and Silberman, referred to earlier, that a constructed- 
response method results in higher scores on a constructed-response test 
than does a multiple-choice method were confirmed. 

Keislar (1959) investigated the relationship between intelli- 
gence and programmed materials, correlating mental age with student 
achievement in a linear program. The course content was based on 
elementary concepts concerning rectangles. Fourteen experimental 
subjects were selected from fifth and sixth grades and matched for 
intelligence, sex, reading ability, and pretest scores. The rank 
order correlation coefficient among the variables was reported as 
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modest. 

Shay (1961) examined tne relationship of intelligence, in 
respect to the probability of obtaining a correct response, to achieve- 
ment. Ninety subjects were selected from the fourth grade on the 
basis of scores on a group intelligence test and assigned to three 
groups* above average, average, and below average. The programmed 
content was a fourth-grade unit in Roman numerals. The results of 
Shay T s study indicate that differences in achievement occurred 
consistently among the three intelligence groups, with the above 
average group achieving at the higher level and the below average 
group achieving at the lowest level. 

McNeil and Keislar (1961) examined the relationships of sex 
and intelligence on achievement in programmed instruction. Sex 
differences were reported in favor of boys. The correlations of 
intelligence and achievement ranged from 0.40 to 0.89. 

A positive relationship between intelligence and mathematics 
achievement was reported by Andrews (1963) utilizing programmed 
instruction with elementary school age subjects,, Andrews found that 
girls consistently exceeded boys in achievement during programmed 
instruction. 



Research Related to Programmed Science Material 

Pollard (1964) using a card deck unit in biology in his 
general science classes, reported that when two groups of students, 
one using the card deck and the other exposed to conventional methods, 
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were tested, there was a significant difference in favor of the 

experimental group, scores showed a difference of 11 . 5 points, with 

the experimental group having a mean 15.8 percent higher than the 
mean of the control group, 

Wong (1964), on the secondaiy level, described the processes 
in producing three units in a biology class, but no data were 
reported as to their implementation. 

On the college level, Lawson (I960) presented data on a one- 
week, scrambled-book, genetics unit. The sample consisted of ten 
sections of students registered for a natural science course. Half 
the sections were used as an experimental group and the other half 
as a control. A thirty-item objective test was administered which 
was designed to measure the transfer of reasoning analysis and 
application. The means and standard deviations of the scores were 
determined for each group with t-test treatment. Results indicated 
the difference in performance- was significant beyond the 0.01 level 
in favor of the experimental group, 

Raush and Cornell (1963) tested an edition of a program on 
molecular equilibrium. The program was finished in a time range of 
from three to twelve hours, with an average of six hours, by 300 
college chemistry students in a general college chemist^ course. 

The investigators reported an increase in grade average as a result 
of the programmed materials. 

R e search R e lated to Programmed Laboratory Experiences 

Wash (1964) conducted an experiment in the use of programmed 



materials in teaching high school chemistry. Investigation relative 
to difference in chemistry achievement between the control and 
experimental group, the variability in time spent in the instructional 
program, and the administrative feasibility of secondary students to 
conduct individual experiments as a regular part of .he programmed 
sequence was performed. The investigator reported that students can 
achieve at significantly higher levels through the use of a 
programmed chemistry sequence which includes laboratory exercises, 
and that there is a positive relationship between time in the 
programmed sequence and achievement. Also concluded was that the 
use of programmed instruction in laboratory science courses is 
feasible and presents no unusual administrative problems. 

A three-phase study has been completed and reported by Hedges 
and MacDougall (1965) regarding the teaching of fourth-grade science 
by means of programmed science materials with laboratory experiences. 
In the first phase of the investigation, a pilot study, the achieve- 
ment of those students using programmed materials during the investi- 
gation was significantly higher at the 0.05 probability level than 
the achievement of those who were taught by more conventional methods. 

The primary purpose of Phase II was to write and refine through 
field testing and analysis, a complete course of science study for 
the fourth grade, using Skinner* s small-step approach. A year-long 
use of these programmed materials revealed no significant differences 
in achievement. 

The efficacy of the programmed learning approach was investi- 
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gated in a more controlled situation, described by Phase III. Nineteen 
fourth-grade classes were randomly assigned to each of three treatment 
groups. The first group of six classes of students studied science 
for one semester using the programmed materials with each child doing 
the experiments. The second group of six classes of students studied 
sciejice for one semester by means of the programmed materials coupled 
with the students only reading about the experiments; the third group 
of seven classes of students studied by means of the programmed 
materials rewritten in textbook form with fixed assignments by the 
teacher and teacher-demonstration of the experiments in the emulation 
of the conventional approach to science teaching. The three groups 
were compared on the instructional effectiveness of the method of 
science instruction, where instructional effectiveness was defined 
as (a) achievement, (b) interest and attitude, (c) value of laboratory 
experiences, and (d) retention. The results of the post-experimental 
testing revealed no significant mean differences were found among the 
three treatments in initial intelligence and science achievement, 
final measures of science achievement, level of interest in science, 
and measures of retention. 

A bulletin has been published by the Michigan Department of 
Public Instruction (Michigan, 1965) presenting open-ended, laboratory- 
centered, science experiences for grades seven, eight, and nine, but 
the literature reveals no published research involving the use of 
these experiences. The Committee on Science responsible for these 
laboratory-experience outlines believe that it is possible to select 




ideas or understandings of an interdisciplinary nature, which can be 
introduced at the junior high school level or lower, and which will 
continue to be valid in progressively more sophisticated fashion at 
each succeeding level to graduate school and beyond. 

A subcommittee, chaired by Professor W. C. Van Deventer, 
Professor of Biology, Western Michigan University, is responsible for 
having implemented a few ideas of the type indicated. The work is 
an on-going project; presently over twenty laboratory-experience 
outlines have been completed. 

There is reported in the literature, research completed 
under the auspices of the Department of Defense, U.S. Air Force, 
certain laboratory-like activities. Some American industries have 
had under study, programming of certain mechanical procedures. 

Cohen and Fetterolf (1963) developed an adaptation of programmed 

instruction for a laboratory course in qualitative analysis. Theory 

is given in lecture but the twenty-four laboratory hours are structured 

so that after necessary experimentation, confirmation of the students' 

* 

answers is given by a programmed guide. No attempt has been made to 
evaluate results by statistical analysis. 

No other reported researches in the area of programmed 
laboratory instruction were discovered by the investigator. 

This chapter has summarized the development of programming, 
research related to developments of programmed instruction in the 
sciences, and research relevant to programmed laboratory experiences. 

The development of programming dates back to the teaching 
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method used by Socrates. The continuous exchange of questions and 
answers under the tutorial method is a forerunner of programmed 
instruction. Efforts by S. L. Pressey in the 1920* s were probably 
the beginning of the method of programming as it is now employed. 

B. F. Skinner and J. C. Holland devised auto-instructional methods 
which have served the current generation as to the bases for its 
excursions into programmed learning. Linear and branching techniques 
are the two basic types of programming. 

In the studies cited the linear or constructed-response 
technique was superior for recall with the exception of the study 
by Larkin (1964) in which it was reported that neither the linear 
nor the branching method was superior. Intelligence was found to 
be related to achievement in programmed instruction. The findings 
concerning the relationship of sex to achievement in programmed 
instruction tend to favor the boys. Data reported for studies which 
were related to programmed science material showed significant 
achievement differences in favor of the experimental groups. 

Few research projects involving programmed laboratory science 
experiences were found. Among those cited, only two reported data 
analyses. One investigation revealed a significant difference in 
achievement favoring the experimental group. The other study reported 
no significant differences were found in achievement or level of 
interest in science. 

In the following chapter the procedure for implementing the 
research project is discussed. Descriptions are given of the 
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experimental design, the subjects, the program, the treatments, the 
sources and types of data, the testing instruments, and the types 
of statistical analyses used. 
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CHAPTER. Ill 



PROCEDURE 

Experimental Design 

For the purpose and objectives of this study, research design 
number 10 proposed lay Campbell and Stanley (1963) was considered 
appropriate, since the groups constitute naturally assembled 
collectives. The investigator did not control the selection of the 
subjects, but in the use of this design it is assumed that the 
experimental treatment can be assigned randomly to the groups. 

Table 1, page 28, depicts the research design incorporated 
by the investigator. 

The experimental schedule followed in executing the research 
plan may be found in Appendix D. 

Subjects 

The subjects in this study were samples of the population of 
students enrolled in the seventh grade of the Clarke County High 
School during the academic year, 1965-1966. The total seventh-grade 
population was assigned to four academic teams of teachers. Each 
team consisted of four teachers who instructed students in respective 
areas of English, mathematics, history, or science. The team-student 
population was divided into four sections, with each section consisting 
of approximately 30 students. The investigator used subjects from 
two academic teams, one population with N = 118, and the other with 
N = 103. A total of eight sections were involved with N = 221. 



Research Design 
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Two science teachers from two academic teams participated in* 
this study. Both cooperating teachers instructed two experimental 
and two control groups. 



The Program 

A programmed unit in seventh-grade science of an inter- 
disciplinary nature was developed by the investigator. The investi- 
gator secured permission from Professor Van Deventer, chairman of 

the Michigan Subcommittee on Science which published a bulletin of 

* 

open-ended, laboratory experience-outlines referred to earlier 

(Michigan, 1965), to use a selected number of the outlines. These 

outlines served as the framework for the programmed sequence of 

instructional material. Six outlines were selected under three 

main ideas. Listed below are the three main ideas and the titles 

\ 

of the six laboratory-experience outlines: 

1. The idea of gradients : there are no sharp boundary 

lines in nature. 

a. A Gradient.* The Effect of pH on Yeast Activity. 

b. A Gradient for the Separation of Chemical Substances: 
Paper Chromatography. 

2. The idea of normal curves: related phenomena tend to 

vary in pattern fashion around a norm. 

a. Normal Distribution Curves Describe Variation 
in Nature. 

b. The Use of Normal Curves in Distinguishing Species. 

3. The idea of measurement as an expression of relationship, 
a* A Study in Measurement. 
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b. Measurement as an Expression of Relationship: 

A Simple Balance. 

The programmed unit of instruction, covering the six lessons, 
each having the respective laboratory experiment as the core learning 
experience, consisted of 590 modified linear frames. A copy of 

these six lessons comprising the unit of instruction may be found 

» 

in Appendix A. This unit of six lessons was used as the sole 
instructional device with the experimental group. 

The linear mode of programming was modified so as to allow 
certain frames to become open-ended. This procedure allows students, 
through a series of inquiries, to gather data, interpret observations, 
and arrive at conclusions based on such data and observations. 

The combination of laboratory instruction and a programmed 
sequence poses some special problems. Particularly, the concept of 
an immediate reinforcement becomes difficult if laboratory experiences 
are to be meaningful. "When laboratory exercises consist of small 
sequential steps with frequent responses, followed by immediate feed- 
back to the pupil, it becomes difficult to allow the student to 
gather evidence so that conclusions will represent insights he has 
reached for himself. The frames containing the laboratory exercises, 
while guiding the student to draw an independent conclusion, attempt 
to assure his reaching a reasonable conclusion based on experimental 
evidence . 

The content of the program involved those items of information 
necessary to an understanding of the idea of a gradient as demonstrated 
in varying pH environments of yeast, or in paper chromatography; the 
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idea of* normal curves and sampling as evidenced by variety in nature 
or in the overlapping characteristics of related species of elm leaves 
and the idea of measurement as an expression of relationship as 
demonstrated by a study in measurement from a microscopic quantity 
to a quantity such as a portion of the school grounds, or by a simple 
balance which called ior the student* s establishing his own system of 
measurement. The desirability for this information was identified 
through pilot studies as the program was being written. 

The programmed sequence was developed during the period from 
June, 1965 to December, 1965. During this time the individual lessons 
in the unit were tried with students from a selected sixth grade in 
Clarke County, Georgia. It was felt that the problem of difficulty 
level of the respective frames could be assessed more confidently if 
students one grade level lower than the experimental population 
were not handicapped in performing the laboratory experiences. 
Sixth-grade students of differing ability levels were used. Each of 
the six lessons was revised after feedback from the students 
involved in the pilot study. 

Before the six programmed lessons were tried on the sixth- 
grade students, all of them were submitted to a program specialist, 
science educator, and a specialist in each of the six fields (e.g., 
a microbiologist, a chromatographer, etc.) for editing. Any 
suggestions for revision were followed prior to their submission to 
the sixth-grade students. Professor Van Deventer assisted the 
investigator in editing all of the six programmed lessons, the 
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testing instruments, and all of the materials involved in this 
experimentation . 

After the individual pilot trials were completed, a final 
field trial was conducted from January 3, to January 26, 1966, at 
Pattie Hilsman Junior High School in Clarke County, Georgia. To 
eliminate possible contamination of the seventh-grade popu la tion, 
this school was selected because of its geographic separation from 
the selected school cooperating in the formal experimentation. The 
investigator taught an experimental group with N = 31, and a control 
group with N = 31 selected from the seventh grade population of 
Pattie Hilsman Junior High School. The pretest instrument was admin- 
istered to both control and experimental subjects at the beginning 
of the fieid trial. At the end of this period, the posttest was 
given to both groups; an evaluation questionnaire was administered 
only to the experimental groups. A copy of this evaluation 
questionnaire may be found in Appendix B. 

The purpose of this final field trial was to identify the 
frames in which students consistently made errors or could not supply 
correct responses. Also, the investigator evaluated each question 
posed by the students to detect any faulty frames in the preface 
frames or in the procedural directions given for the performance of 
the experiment. Four of the six lessons were revised as a result 
of this final field trial. Other invaluable information was gleaned 
from this experience which contributed to a much smoother operation 
once the formal experimentation was begun. 
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The two seventh-grade science teachers at Clarke County Junior 
High School, who cooperated in teaching the eight sections of expert-’ 
mental and control groups, reviewed the six programmed lessons and 
performed the laboratory experiments prior to the formal experimenta- 
tion period. 

The modified Dale-Chall Readability Formula was used to 
determine the readability level of the materials. The readability 
level was found to be 5.6. 

Description of the Treatments 

Before initiating the experiment the investigator met with 
the two experimental and control teachers to explain the overall 
project and the teacher* s role in the study. In the experimental 
sections the teachers were instructed to use the programmed laboratory 
experiences for the regular instruction. The experimental subjects 
were free to use textbooks, periodicals, and other resource material 
to supplement the programmed texts; but as the unit developed, no 
supplementary materials were used. The two teachers were instructed 
not to use any materials for total classroom instruction other than 
the programmed, laboratory materials with the experimental subjects. 

The teachers were instructed to teach the control sections in 
their usual manner , using the curriculum materials provided by the 
investigator. For the control classes the teachers were provided 
with the content of the programmed units and with assistance in 
obtaining materials and preparing demonstrations in a more conventional 
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approach to teaching science* Six science worksheets, with questions 
comparable to those posed within the six programmed lessons, were 
provided for the teachers. The approach was still one of group 
problem solving; the principal special instruction for tho teaching 
of the control groups was that no programmed materials of any sort 
were to be used in their classes at any time during the term of the 
experiment. Also there were to be only teacher— demonstrations and 
no individual experiments performed by the students. 

The teachers were allowed to assign the science worksheets as 
homework in the control groups as they saw fit. These worksheets 
consisted of approximately ten questions pertaining to the respective 
lesson at hand. The student was sent to as many as eight resource 
books to secure information necessary for a response to a respective 
question. A copy of these worksheets may be found in Appendix A. 

The experimental groups received no specific homework assign- 
ments, but they were allowed to study the material at home if they 
so desired. The students were urged to do some studying as homework 
in order to all ow ma ximum class time for the performance of the 
laboratory experiments. 

Administration and Duration of Experiment 

The formal experimentation was begun at the Clarke County 
Junior High School, January 28, 1966. At this time the pretesting 
instrument was administered to both experimental and control groups. 
The Ruder General Interest Survey (Form E) was also administered. 
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The pretesting was preceded tyr an orientation for both control and 
experimental groups, with specific instructions delivered to the 
experimental groups in the use of the programmed sequence, the data 
they were asked to record, and the laboratory procedures to be 
followed during the experimental period. 

The experimental groups next were allowed to proceed with the 
programmed instruction. ‘Where a laboratory exercise was encountered 
by the student, he was instructed, within the programmed lesson, to 
proceed to the designated location in the laboratory. Here he would 
find equipment and supply items numbered to correspond with the list 
of needed supplies in the programmed lesson. The laboratory areas 
were under the guidance of the instructor who supervised laboratory 
activities and checked the outcome of experiments. 

Concomitantly, the control groups proceeded as usual with the 
teacher lecturing and calling for discussion when a problem needed 
solving. Former reference was made to the assigning of science 
worksheets containing questions relevant to the lesson at hand. 
Teacher-demonstrations were performed sometime during the time spent 
on a respective lesson. 

The investigator observed in all the experimental groups for 
some time every class period. Time was also spent in the control 
groups by the investigator and some demonstrating was performed by 
the investigator in the control groups. 

At the conclusion of the experimental period the posttest was 
administered to all eight sections. All eight sections were given 
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the K uder General Interest Survey (Form E) as a post experience, to 
evaluation questionnaire was administered to the experimental groups. 

A copy of this questionnaire may be found in Appendix B. 

With twenty school days allowed for the duration of the 
experiment, February 25, 1966 concluded the program. 

Sources and Types of Data 

The data needed were obtained from six main sources; (a) records 
in the Clarke County Guidance Department, (b) cumulative record cards 
from the principal 1 s office, (c) the administration of the Kuder 
General Interest Survey (Form E), (d) the administration of the unit 
pre- and posttest, (e) the administration of the evaluative question- 
naire, and (f) the record of the number of frames missed in each 
respective programmed lesson. 

Data from the Clarke County Guidance Department 

Test scores for all eight Clarke County seventh-grade classes 
on the California Achievement Tests, Junior High Level and the 
California Short-Form Test of Mental Maturity , Junior High Level for 
the October, 1965 testing were obtained through the Clarke County 
Guidance Department. Scores were obtained for each of the subjects 
as follows; 

The Cali a Achievement Tests , Junior High Level 

1. Reading vocabulary score 

2. Reading comprehension score 
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The California Short-Form Test of Mental Maturity , Junior 
High Level 

1. Language intelligence score 

2. Non-language intelligence score 

Data from Cumulative Record Cards 

Age, sex, grade level, and average of previous science 
grades were obtained from cumulative record cards. 

Administration of the Kuder General Interest Survey (Form E) 

In order to document any change in scientific interest as a 
result of the experimental project, the Kuder General Interest Survey 
(Form E) was administered at the beginning of the research project 
and at the conclusion. Scientific Interest scores were analyzed from 
this inventory on the subjects. 

Administration of the Unit Pre- and Posttest 

A specially-prepared test was administered before the beginning 
of the experimental period to determine students 1 prior knowledge of 
the respective curriculum content of the experimental instructional 
materials. The test consisted of 50 multiple-choice items. A copy 
of this test may be found in Appendix B. 

Both the pretest and posttest were administrations of the same 
test prepared by the investigator. No standardized achievement measure 
was available which contained a sufficient range and number of items 
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concerning the six selected, laboratory experiences chosen for the 
experimental curriculum. 

Administration of an Evaluative Questionnaire 

At the close of the study the classroom teachers administered 

an evaluative questionnaire to the experimental subjects. This 

questionnaire was given to attempt to evaluate the students 1 reactions 

to the programmed texts. The questionnaire was devised by the invest- 

* 

igator after having reviewed several questionnaires used for similar 
purposes in other studies. The students were encouraged by the 
teachers and the investigator to give their honest opinions. 

The two teachers were also asked to evaluate the experiment. 
Strengths and weaknesses of the materials were pointed out. The 
reactions of the teachers were comparable with the general reaction 
of their individual classes. 

Record of Number of Frames Missed 

Each student in the experimental groups was given a mimeographed 
form, asking for the recording of the number of each frame missed 
within each of the six programmed lessons. The teachers and the 
investigator stressed that this record in no way counted as a mark 
against the student, but rather that the student was furnishing the 
investigator with helpful evidence as to the reliability of a 
particular frame. 
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Description of the Tests 

California Achievement Tests , Junior High Level 

The new edition was re-standardized in 1963 to take account 
of faults in the 1957 norms. A sample of 341 schools in 48 states 
was carefully selected to insure obtaining an unbiased sample. For 
the junior high level, grades seven through nine, the reliability 

m 

coefficients for the various tests range from 0.83 to 0.95 with a 
reliability coefficient for the total battery of 0.98. These 
coefficients were computed using the Kuder Richardson formula 21. 

The standard error of measurement for the raw score is 9.4 points, 
for grade placement, 0.2 months. 

The item discrimination data are indicative of the efficient 
functioning of nearly all items. A high degree of concurrent validity 
is reflected in the high correlation of scores on the new edition and 
other standardized achievement tests (California Manual, 1963). 

< 

California Short-Form Test of Mental Maturity , Junior High Level 
This test yields three mental age scores— language, non- 
language, and total. The scores are also interpretable in the form 
of deviation I.Q. scores. 

Coefficients of reliability computed by the split-halves method 
and corrected by the Spearman-Brown formula are 0.93 for language, 

0.89 for non-language, and 0.95 for total. Standard errors of measure- 
ment for mental ages are 7.7, 9.1, and 6.3 months, respectively 
( California Manual, 1963 ) . 
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The validity claims of the publishers are based primarily on 
the original design of the test which evolved from a comprehensive 
analysis of the Stanford - Binet , ana tbe test 1 s high correlation with 
the individual Stanford-Binet and other intelligence tests. 

• Kuder General Interest Survey (Form E) 

The Kuder General Interest Survey (1964 Edition) was developed 
in response to a need for such an instrument for use with younger 

t 

people, particularly at the junior high level. It consists of ten 
scales, surveying interests in a wide range of activities. Among 
these ten interest areas is a Scientific Interest Scale. 

Normative data for the Survey were developed in the spring of 
1963, when the Survey was administered on a nationwide "oasis to a 

t. stratified sample of 6,950 boys and 6,565 girls in grades 6 through 

12 from public elementary and secondary schools in the United States. 

1 Approximately six weeks after administration of the Survey 

to the standardization sample, a randomly selected stratified sub- 
sample, consisting of 287 boys and 433 girls in grades 6-8 and 471 

‘ boys and 691 girls in grades 9-12, took the Survey a second time. 

Means, standard deviations, and test-retest correlations were 
calculated. Correlations were mainly in the 0.70s and 0.80s. 

i 

The Kuder-Richardson formula 20 was employed to check the 
internal consistency of the scales. Intercorrelations of the scales, 
for girls in the standardization sample, ranged from -.30 to +.37; 
for boys, from —.37 to +.35* Most of the correlations were close to 
zero. (Kuder Manual, 1964). 

1 er|c 
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Unit Pre- and Posttest 

The unit pre- and posttest was developed in anticipation of 
measuring pre- and post-experimental behavior in terms of both 
factual, science subject-matter competency, and the comprehension of 
scientific concepts, principles, and generalizations. The total 
test consisted of 50 multiple-choice items designed to cover the 
content of the curriculum of the six lessons in the unit. By jury 
validation, the 50 multiple-choice item test was judged to consist 
of 20 fac cual-oriented items and 30 conceptual-oriented items. The 
items were examined by the jury members for the specific fact or 
concept tested for in that item. 

The test items, themselves, were revised after having been 
administered to the sixth-grade students participating in the 
original pilot tests, when the reading difficulty was questionable. 
Suggestions from the jury reviewers were incorporated in revising 
specific test items. The two participating seventh-grade teachers 
at Clarke County Junior High School also reviewed the test items. 

The Test Scorer and Statistical Analysis 2 computer program 

t 

was used to gain information on the testing instruments."^ 

For optimum discrimination a difficulty index of 0.50 is 
desirable. Table 2 and Table 3 in Appendix C give indices of item 
difficulty on the pretest and posttest used in this study. There 
were no extremely easy or extremly difficult items identified on 
either the pretest or the posttest. A comparison of the item 
difficulties between the two tests showed the difficulty levels of 
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the posttest to be more consistent among themselves than the diffi- 
culty levels of the pretest. Of the 50 items on the pretest, 24 items 
were of medium difficulty, 26 items were more difficult with an index 
Oj. difficulty less than 0.40. Items are considered of medium diffi- 
culty with 40-50-60 percent passing. 

On the posttest, 33 items were of medium difficulty, 10 items 
had an index of less than 0.40, and seven items were easier than 

those considered of moderate difficulty, with an index greater than 

0 . 60 . 

An examination of Table 2 in Appendix C, showing the propor- 
tions choosing each of the four options for the fifty items on the 
pretest, demonstrates that each option was discriminating and served 
as a distractor. The proportion of the population choosing each item 
on the posttest is shown in Table 3. 

The results of correlating each item with the total test are 
found in Table 4. Table 4, in Appendix C, lists the item difficulty, 
the standard deviation of the item, and the point biserial correla- 
tion of each item with the total test score on the pretest. The null 
hypothesis is rejected at either the 0.05 level or the 0.01 level or 
both for all except eight items on the prebest. An examination of the 
pretest standard deviations of items reveals that the majority of 
items have a deviation near or at 0.50. 

Table 5, in Appendix C, refers to the same statistics for the 
posttest. Again item difficulty is shown along with the item standard 
deviation and the item-test correlation. 
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The null hypothesis is rejected at the 0.01 level of signifi- 
cance for all test items with the exception of item number 31 being 
significant at the 0.05 level, and items 11, !U, and 34 being 
nonsignificant. A total of 47 items out of 50 or 94 percent of the 
test items were significantly related to the total test score. Thus, 
94 percent of the items apparently made a contribution to the total 
variance of the test. 

Table 6 presents the mean, standard deviation, skewness, 
kurtosis, and standard error of each for the pre-f actual test, pre- 
conceptual test, pre-total test, post-factual test, post-conceptual 
test, and post-total test. The raw score mean of the pre-total test 
was 19.54 with a of 0.40, and a standard deviation of 5.92, with 
a of 0.24. Positive skewness was significant at the 0.05 level. 
Positive skewness is not unusual for a test in an area which has not 
been systematically pursued by students (i.e., by formal or informal 
classroom work). It is reasonable to suppose that a few students 
may have acquired some knowledge by independent study or informal 
inquiry, or even that an element of shrewdness in guessing (perhaps 
by narrowing four choices to two and improving the chances of a 
correct response without precise prior knowledge) may have permitted 
a few astuce persons to' obtain higher scores. 

The raw score mean of the pre-f actual test (total number of 
items = 20) was 7.86, with a Sj of 0.21, and a standard deviation of 
3.07, with a S^of 0.13. 

Pre-conceptual test raw score statistics for a total number 
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Table 6 



Unadjusted Keans, S-, Standard Deviation, , Skewness, 

S^ , Kurtosis, and S^ , for pre- and Posttests 

4 



Statistic 


Pre 

Factual 


Pre 

Concept 


Pre 

Total 


Post 

Factual 


Post 

Concept 


Post 

Total 


Mean 


7.86 


11.69 


19.54 


11.23 


15.70 


26.93 


S- 

X 


0.21 


0.26 


0.40 


0.24 


0.34 


0.53 


S.D. 


3.07 


3.84 


5.92 


3.43 


4.99 


7.90 


S <T 


0.13 


0.17 


0.24 


0.14 


0.20 


0.30 


Skewness 


,) 0.11 


0.27 


0.42 


-0.05 


-0.02 


0.00 


s «l3 


0.16 


0.16 


0.16 


0.16 


0.16 


0.16 


Kurtosis (<^ 


4 )-0.36. 


-0.18 


-0.51 


-0.69 


-0.60 


-0.70 


S c(4 


0.33 


0.33 


0.33 


0.33 


0.33 


0.33 



45 



of 30 items yielded a mean of 11.69 with a S- of 0.26, and a standard 

X 

deviation of 3.84, with a S^of 0.17. 

Results for the total posttest revealed a raw score mean of 
26.93, with *< S- of 0.53, and a standard deviation of 7.90, with a 

A 

S^of 0.30. Kurtosis was significantly negative at the 0.05 level, 
indicating a relatively platykurtic curve. 

Examination of post-factual test statistics disclosed a raw 
score mean of 11.23, with a of 0.24, and a standard deviation of 
3.54, with a ^ of 0.14. Kurtosis was again significantly negative 
at the 0.05 level, describing a platykurtic curve. 

Inspection of the post-conceptual test showed a mean of 15.70, 
with a 'S^ of 0.34, and a standard deviation of 4.99, with a of 0.20. 

Test reliability was computed by the Kuder-Richardson formula 
20. The reliability coefficient for the total pretest was 0.70. 
Treatment of data from the pre-f actual-oriented subtest yielded a 
reliability coefficient of 0.58. The pre-conceptual-oriented subtest 
had a reliability coefficient of 0.57. 

The reliability coefficient obtained for the total posttest 
was 0.84. ?ost~f actual-oriented test reliability was 0.68; the post- 
con ceptual-oriented subtest had a Kuder-Richardson coefficient of 
0.74. A copy of the total test may be found in Appendix B. 

Treatment of Data from Evaluation Questionnaire 

In order to determine the students * reactions to the program- 
ming method, chi-square values were calculated from the responses to 
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the Evaluation Questionnaire. 

The first ten questions provided three to five cells for the 
student 1 s optional choice. The last two items on the questionnaire 
asked for a discussion of least-liked and most-liked features of 
the programmed laboratory materials. Representative comments from 
answers to items 11 and 12 are reported in Appendix D. 

The questionnaire is included in Appendix B, and referred to 
by question number on Table 7, which summarizes the results of the 
analysis of the first ten questions of the questionnaire. For 
purposes of analysis the first two and the last two response 
categories are combined. The combined first two categories are 
referred to as the favorable category in all questions except number 
three, number six, and number eight in which the combined last two 
categories are referred to as the favorable category. The combined 
last two categories are referred to as the unfavorable category in 
all questions except numbers three, six, and eight in which the 
combined first two categories are referred to as the unfavorable 
category. 

The hypothesis of independence of the response categories and 
classes was tested by computing chi-square for each of the ten questions 
on the questionnaire. If the hypothesis of independence is true, then 
the proportions of response in each of the three categories will be 
the same regardless of the class involved. 

An examination of Table 7 which compares the favorable, indif- 
ferent, and unfavorable response categories, reveals that only question 
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Tablo 7 

A Corparinon of the Favorable, Indifferent, and Unfavorable 
responses to the Uvaluntlvc Questionnaire 
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Question 
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four has an equivalence of response. The preponderant favorable 
responses, to nine out of the ten objective questions on the 
evaluative questionnaire, were highly significant at the 0.01 level 
of significance. 

Treatment of Data from Programmed Materials 

Each of the six programmed, laboratory-experience lessons was 
checker to identify specific frames which elicited an incorrect 
response from the experimental subjects. A mean number of correct 
responses was computed for each of the six lessons, as well as the 
standard deviation. Lesson 1, "A Gradient; The Effect of pH on least 
Activity 11 , had a total of 77 frames with a mean number correctly 
answered of 71.38, and a standard deviation of 8.86. 

For a total of 67 frames in Lesson 2, "A Gradient for the 
Separation of Chemical Substances; Paper Chromatography 11 , a mean of 
63.04 was obtained with a standard deviation of 7.07. 

A mean of 98.62 resulted from the analysis of the frames answered 
correctly in Lesson 3> "Normal Curves and Warping Factors". The- total 
possible number was 104 frames. The standard deviation was 10.75. 

Lesson 4, "The Use of Normal Curves in Distinguishing Species", 
had a total of 89 fr jaes with a mean number correct of 84.61, and a 
standard deviation of 9.44. 

"A Study in Measurement", Lesson 5, was 182 frames in length. 
Comparatively, Lesson 5 was approximately twice the length of the 
other five lessons. Inasmuch as the use of the microscope was 
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necessary in this exercise, abundant illustrations and instructive 
frames were included in Lesson 5 > along with the laboratory experiment 
itself. The mean number of correct frame responses was 175.53, with 
a standard deviation of 17.86. 

For a total of 73 frames in Lesson 6, "Measurement as an 
Expression of Relationship? A Simple Balance", the mean correct 
was 70.22, with a standard deviation of 7.25. 

Table 8, a correlation matrix for the six programmed, 
laboratory lessons, can be found in Appendix C, page 209. 

'Statistical Analysis 

Analyses of variance and covariance were used to an aly ze the 
data. Because of disproportionate numbers in the groups being 
considered, a least— squares method of fitting constants was employed 
(Winer, 1961). Scores on reading vocabulary, reading comprehension, 
language intelligence, non-language intelligence, science average, 

Kuder General Interest Survey (Form E), the Unit Pretest, teacher, 
and sex were evaluated as to their effect on posttest scores with 
regard, to factual and conceptual competencies in science. The 
categorized ability levels pointed out earlier were also covariates. 

This is a treatment-by-lcvels statistical model in which the 
levels of the variables sex, ability, and achievement are not 
randomly selected. Use of the analysis of covariance technique allowed 
control over predicted differences between the experimental and control 
groups on the dependent variables.. By adjusting for pretest differences, 
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this technique had the effect of controlling the students 
statistically. 



CHAPTER IV 



RESULTS 



The least-squares analyses indicate the relationships between 
achievement as measured by the factual-oriented posttest, the 
cone eptual-or i ent ed posttest, and the total posttest scores, and 
the subject variables considered. They also indicate the relation- 
ship of scientific interest and the subject variables considered in 
the study. The 0.05 probability level was the criterion for 
rejecting the null hypothesis. By use of F tables, it was determined 
that for 200 degrees of freedom, an »F" ratio of 3.89 and 6.76 are 
significant at the 0.05 and 0,01 levels of significance, respectively. 



Achievement as Measured by the Posttest Factual Scores (Table 9) 

1. Evidence obtained from the analysis of the posttest factual 
scores of the experimental group and the control group 
indicates that the control group achieved at a significantly 
higher level than the experimental group (F = 9.33; p < 0.01). 

2. Although the teacher variable did not prove to affect, 
significantly, the achievement of students, the inter- 
action effects of treatment by teacher proved to be 
significant in favor of teacher one with the control 
group and teacher two with the experimental group 

(F = 4.68; p C 0.05). 

3. Scores earned on the pre-factual test proved to be related 
positively to achievement on the post-factual test 

(F = 5.37; p ^ 0.05). 

4. The relationship of science grades assigned by the 
seventh-grade science teachers for the first two quarters 
of the academic year, 1965-1966, proved to be significant 
(F = 12.86; p < 0.01). 

5. Reading comprehension scores were positively related to 
achievement in the experimental program (F = 7.28; 

p < 0.01). 



